Skip to content

Conversation

@pmatczak2
Copy link

@pmatczak2 pmatczak2 commented Nov 6, 2024

  1. Enhanced Documentation:
    Added detailed comments throughout the _FitLoop class to explain the purpose and functionality of key methods, particularly setup_data(), on_run_start(), and on_advance_start().
    Clarified Logic for Data Loading:
    Explained the rationale behind multiple calls to setup_data(), emphasizing its role in ensuring that data loaders are fresh for each epoch and the conditions under which they are reloaded.
  2. Improved Readability:
    General improvements to code readability by adding comments that provide context for the flow of the code and the design decisions made, making it easier for future developers to understand the implementation.
  3. Specific Method Highlights:
    setup_data(): Documented its purpose in managing training data loaders and handling overfitting scenarios.
    on_run_start(): Clarified its role in setting up validation data loaders and invoking relevant hooks.
    on_advance_start(): Explained the necessity of calling setup_data() to prepare for the current epoch.

📚 Documentation preview 📚: https://pytorch-lightning--20402.org.readthedocs.build/en/20402/

awaelchli and others added 30 commits April 3, 2024 17:53
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <[email protected]>
…19852)

* distributed checkpoints

* use decorator

* refactor if-strict

* update example

* filter non-persistent buffers (todo, add test)

* simplify checkpoint loading for model
…oints (Lightning-AI#19870)

* memory-optimized loading of full checkpoints into dist model

* simplify

* handle buffers

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* handle strict loading, buffers, and add test

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* chlog

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
…ghtning-AI#19872)

* Load optimizer state

* move to utility

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* ModelParallelStrategy for Lightning Trainer

* mypy

* import fix

* fix torchscript errors

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix docs issue

* fix test execution

* Update src/lightning/pytorch/strategies/model_parallel.py

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Luca Antiga <[email protected]>
* Add 2D parallel example

* replace with torchtitan code
tshu-w and others added 12 commits September 30, 2024 18:08
* Add documentation note for TQDMProgressBar

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* upggrade requiremnets.txt

* update fabric bitsandbytes linear quantization for bnb 0.44.1

* add quant_storage param

* exclude macos from bnb upgrade

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* docs: fix removed ref to `deepspeed.initialize`

* fix links
…ning-AI#20354)

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <[email protected]>
update tutorials to `b83fde09`

Co-authored-by: Borda <[email protected]>
@github-actions github-actions bot added docs Documentation related ci Continuous Integration fabric lightning.fabric.Fabric pl Generic label for PyTorch Lightning package dependencies Pull requests that update a dependency file dockers package store app data labels Nov 6, 2024
@lantiga
Copy link
Collaborator

lantiga commented Nov 12, 2024

Thank you @pmatczak2 , your PR targets the main branch but it should target master. However when I retarget it, it seems to have no commits.

I'll close it for the time being:

  • can you re-open it against master
  • make sure it contains the correct commits
  • change the PR title to reflect its content

thanks!

@lantiga lantiga closed this Nov 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

app ci Continuous Integration data dependencies Pull requests that update a dependency file dockers docs Documentation related fabric lightning.fabric.Fabric package pl Generic label for PyTorch Lightning package store

Projects

None yet

Development

Successfully merging this pull request may close these issues.